Morphological Disambiguation of Turkish with Free-order Co-occurrence Statistics
نویسندگان
چکیده
منابع مشابه
Tagging and Morphological Disambiguation of Turkish Text
Automat ic text tagging is an important component in higher level analysis of text corpora, and its output can be used in many natural language processing applications. In languages like Turkish or Finnish, with agglutinative morphology, morphological disambiguation is a very crucial process in tagging, as the structures of many lexical forms are morphologically ambiguous. This paper describes ...
متن کاملPlace Disambiguation with Co-occurrence Models
In this paper we describe the geographic information retrieval system developed and results achieved by the Multimedia & Information Systems team for GeoCLEF 2006. We detail our methods for generating and applying co-occurrence models for the purpose of place name disambiguation, our use of named entity recognition tools and text indexing applications. The presented system is split into two sta...
متن کاملA Hybrid Morphological Disambiguation System for Turkish
In this paper, we propose a morphological disambiguation method for Turkish, which is an agglutinative language. We use a hybrid method, which combines statistical information with handcrafted rules and learned rules. Five different steps are applied for disambiguation. In the first step, the most likely tags of words are selected. In the second step, we use handcrafted rules to constrain possi...
متن کاملLearning Morphological Disambiguation Rules for Turkish
In this paper, we present a rule based model for morphological disambiguation of Turkish. The rules are generated by a novel decision list learning algorithm using supervised training. Morphological ambiguity (e.g. lives = live+s or life+s) is a challenging problem for agglutinative languages like Turkish where close to half of the words in running text are morphologically ambiguous. Furthermor...
متن کاملMorphological Disambiguation of Turkish Text with Perceptron Algorithm
This paper describes the application of the perceptron algorithm to the morphological disambiguation of Turkish text. Turkish has a productive derivational morphology. Due to the ambiguity caused by complex morphology, a word may have multiple morphological parses, each with a different stem or sequence of morphemes. The methodology employed is based on ranking with perceptron algorithm which h...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Gümüşhane Üniversitesi Fen Bilimleri Enstitüsü Dergisi
سال: 2019
ISSN: 2146-538X
DOI: 10.17714/gumusfenbil.430034